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Amendments to the Claims : 

This listing of claims will replace all prior versions, and listings, of claims in the application: 
Listing of Claims : 

1 . (currently amended) A method for distributing data in a system that includes a 
plurality of servers, the method comprising: 

identifying ones of the servers to store a replica of the data based on at least one of 
utilization of the servers, prior data distribution involving the servers, [[and]] or failure 
correlation properties associated with the servers; and 

placing the replicas of the data at the identified servers. 

2. (original) The method of claim 1, wherein the identifying ones of the servers 
includes: 

identifying underutilized ones of the servers as candidates to store the replicas of the data. 

3. (original) The method of claim 2, wherein the underutilized servers are identified 
based on disk space usage below a determined amount. 

4. (original) The method of claim 1, wherein the identifying ones of the servers 
includes: 

identifying ones of the servers that have not been involved in a recent data distribution as 
candidates to store the replicas of the data. 



-2- 



PATENT 

Application Serial No. 10/608,139 
Docket No. 0026-0029 

5. (original) The method of claim 1, wherein the identifying ones of the servers 
includes: 

identifying system conditions that affect two or more of the servers, and 
identifying ones of the servers as candidates to store the replicas of the data based on the 
identified system conditions. 

6. (original) The method of claim 1, wherein a number of the replicas of the data 
stored by the servers is user-configurable. 

7. (currently amended) A system for distributing chunks in a network that includes 
a plurality of servers, comprising: 

means for selecting ones of the servers to store replicas of the chunks based on at least 
one of utilization of the servers, prior chunk distribution involving the servers, [[and]] or failure 
correlation properties associated with the servers; and 

means for storing the replicas of the chunks at the selected servers. 

8. (currently amended) A file system, comprising: 
a plurality of servers that store replicas of chunks; and 

a master connected to the servers, the master being configured to: 

identify one or more of the servers to store a replica of a chunk based on at least 
one of utilization of the servers, prior chunk distribution involving the servers, [[and]] or 
failure correlation properties associated with the servers, and 
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place the replicas of the chunk at the identified one or more servers. 

9. (currently amended) [[A]] The method for distributing chunks of data in a system 
that includ e s a plurality of s e rv e rs that store replicas of th e chunks, th e m e thod comprising: of 
claim 1, wherein the replicas are associated with chunks of data; 

the method further comprising: 

monitoring total numbers of the replicas of the chunks available in the system; 
identifying chunks that have a total number of replicas below one or more chunk 
thresholds; 

assigning priorities to the identified chunks including at least one of: 

assigning a higher priority to one of the identified chunks whose total number of 

replicas is farther away from a corresponding one of the one or more chunk thresholds 

than another one of the identified chunks whose total number of replicas is closer to 

another corresponding one of the one or more chunk thresholds, 

determining priorities for the identified chunks based on whether the identified 

chunks are associated with active files, [[and]] or 

determining priorities for the identified chunks based on whether the identified 

chunks are blocking progress within the system; and 

re-replicating the identified chunks based substantially on the assigned priorities. 

10. (original) The method of claim 9, wherein the one or more chunk thresholds are 
user-configurable. 
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1 1 . (original) The method of claim 9, wherein the one or more chunk thresholds are 
the same for all chunks. 

12. (original) The method of claim 9, wherein the one or more chunk thresholds are 
set for each class or type of chunk. 

13. (original) The method of claim 9, wherein the assigning priorities to the 
identified chunks alternatively or additionally includes: 

determining priorities for the identified chunks based on how close the total numbers of 
replicas for the identified chunks are to the one or more chunk thresholds. 

14. (original) The method of claim 9, wherein the re-replicating the identified chunks 
includes: 

cloning a higher priority one of the identified chunks prior to cloning to a lower priority 
one of the identified chunks. 

15. (original) The method of claim 14, wherein the cloning includes: 

instructing one of the servers to copy one of the identified chunks from another one of the 

servers. 
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16. (currently amended) The method of claim 9, wherein the re-replicating the 
identified chunks includes: 

identifying ones of the servers based on at least one of utilization of the servers, prior 
data distribution involving the servers, [[and]] or failure correlation properties associated with 
the servers, and 

instructing the identified servers to copy ones of the identified chunks from other ones of 
the servers. 

17. (currently amended) [[A]] The system for distributing data in a n e twork that 
includ e s a plurality of s e rv e rs that stor e r e plicas of the data, th e system of claim 7, further 
comprising: 

means for monitoring total numbers of the replicas available in the network; 
means for identifying data chunks that has have a total number of replicas below one or 
more thresholds; 

means for prioritizing the data identified chunks , including at least one of: 

means for assigning a higher priority to data one of the identified chunks whose 
total number of replicas is farther away from a corresponding one of the one or more 
thresholds than data another one of the identified chunks whose total number of replicas 
is closer to another corresponding one of the one or more thresholds, 

means for determining priorities for the data identified chunks based on whether 
the data identified chunks are [[is]] associated with active files, [[and]] or 
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means for determining priorities for the data identified chunks based on whether 
the data is identified chunks are blocking progress within the network; and 
means for re-replicating the data identified chunks based on the assigned priorities. 

18. (currently amended) [[A]] The file system of claim 8 , comprising: 
a plurality of s e rv e rs configur e d to stor e r e plica s of chunks of data; and 
a mast e r conn e ct e d to th e s e rv e rs, wherein the master b e ing is further configured to: 
monitor total numbers of valid ones of the replicas stored by the servers, 
identify chunks that have a total number of valid replicas below one or more 
thresholds, 

assign priorities to the identified chunks by at least one of: 

assigning a higher priority to one of the identified chunks whose total 
number of valid replicas is farther away from a corresponding one of the one or 
more thresholds than another one of the identified chunks whose total number of 
valid replicas is closer to another corresponding one of the one or more chunk 
thresholds, 

determining priorities for the identified chunks based on whether the 
identified chunks are associated with active files, [[and]] or 

determining priorities for the identified chunks based on whether the 
identified chunks are blocking progress within the file system, and 
re-replicate the identified chunks based substantially on the assigned priorities. 
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19. (original) A method for redistributing chunks of data in a system that includes a 
plurality of servers that store replicas of the chunks, the method comprising: 

monitoring utilization of the servers; 

determining whether to redistribute any of the replicas; 

selecting one or more of the replicas to redistribute based on the utilization of the servers; 
selecting one or more of the servers to which to move the one or more replicas; and 
moving the one or more replicas to the selected one or more servers. 

20. (original) The method of claim 19, wherein the utilization of the servers relates to 
an amount of free disk space available at the servers. 

21 . (original) The method of claim 19, wherein the selecting one or more of the 
servers includes: 

identifying underutilized ones of the servers as candidates to which to move the one or 
more replicas. 

22. (original) The method of claim 21, wherein the underutilized servers are 
identified based on disk space usage below a determined amount. 

23. (original) The method of claim 19, wherein the selecting one or more of the 
servers includes: 
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identifying ones of the servers that have not been involved in a recent redistribution as 
candidates to which to move the one or more replicas. 

24. (original) The method of claim 19, wherein the selecting one or more of the 
servers includes: 

determining failure correlation properties associated with the servers, and 
identifying ones of the servers based on the failure correlation properties as candidates to 
which to move the one or more replicas. 

25. (original) The method of claim 19, wherein the moving the one or more replicas 
includes: 

deleting the one or more replicas from one or more of the servers, and 
instructing the selected one or more servers to copy the one or more replicas from another 
one or more of the servers. 

26. (original) A system for redistributing data in a network that includes a plurality of 
servers that store replicas of the data, the system comprising: 

means for monitoring utilization of the servers; 

means for selecting one or more of the replicas to redistribute based on the utilization of 
the servers; 

means for identifying one or more of the servers to which to move the one or more 
replicas; and 
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means for redistributing the one or more replicas to the identified one or more servers. 
27. (original) A file system, comprising: 

a plurality of servers configured to store replicas of chunks of data; and 
a master connected to the servers, the master being configured to: 

select one or more of the replicas to redistribute based on utilization of the 

servers, 

identify one or more of the servers to which to move the selected one or more 
replicas, and 

move the selected one or more replicas to the identified one or more servers. 
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